음악 유사도 비교를 위한 Siamese 네트워크 기반 그래프 임베딩의 개선

송창헌; 이용현; 김형주; Changheon Song; Yonghyun Lee; Hyungjoo Kim

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보과학회 논문지 > 정보과학회 컴퓨팅의 실제 논문지 (KIISE Transactions on Computing Practices)

정보과학회 컴퓨팅의 실제 논문지 (KIISE Transactions on Computing Practices)

Current Result Document :

한글제목(Korean Title)	음악 유사도 비교를 위한 Siamese 네트워크 기반 그래프 임베딩의 개선
영문제목(English Title)	Improvement of Graph Embedding Based on Siamese Network for Comparison of Music Similarity
저자(Author)	송창헌 이용현 김형주 Changheon Song Yonghyun Lee Hyungjoo Kim
원문수록처(Citation)	VOL 26 NO. 11 PP. 0493 ~ 0498 (2020. 11)
한글내용 (Korean Abstract)	음악 시장의 성장에 따라 사용자는 일부 음악에 국한되어 노출되고 선택하게 된다. 많은 서비스는 메타데이터로 라이브러리를 구성하여 검색 및 추천 문제에 접근하고 있다. 이때, 새로 나오거나 인지도가 없는 음악의 경우 결과에서 제외될 수 있다. 일반적으로 사용되는 오디오 피처는 해상도에 따른 차원의 변화 폭이 크기 때문에 CNN의 입력으로 사용하기에 어려움이 있다. 본 논문에서는 음악 그래프 피처를 추출하고 임베딩하여 유사도를 비교할 수 있는 모델을 제안한다. 모델은 피처 추출과 Siamese 네트워크로 구성된다. 피처 추출에서는 각 음악 신호를 오디오 피처로 변환하고, 각 음악의 그래프 피처를 구성한다. 이후, Siamese 네트워크에서 각 그래프 피처를 GCN과 어텐션 기법을 활용하여 잠재 공간으로 임베딩하고, NTN을 통해 서로 다른 두 벡터의 유사도를 도출한다. 마지막으로 실험을 통해 음악 신호의 유사도 비교를 위한 오디오 피처의 그래프 피처 추출이 효과적인 방식임을 입증하였다.
영문내용 (English Abstract)	As the music market grows, people are exposed to and provided with selective music. Many services use metadata for building music libraries. In this situation, songs from independent labels and new artists that do not have previous information are still excluded from the result of searches and recommendations. In this paper, we focus on making the music scoring model for calculating the similarity score of two music signals. The model comprises the Siamese network and the scoring layer. The Siamese network embeds audios to small latent vectors and passes them to the scoring layer. The audio feature is difficult to use as an input to the CNN because of the dimensionality problem. Our approach is compared to previous works because it retains the sequence information of the peak frequencies in the spectrogram by transforming it into a graph. The effectiveness of the graphical approach is shown as the result of the experiment.
키워드(Keyword)	오디오 컨텐츠 그래프 임베딩 그래프 콘볼루셔널 네트워크 Siamese 네트워크 audio content graph embedding graph convolutional network Siamese network
파일첨부	PDF 다운로드